We claim; 

1. A method for generating a table of contents for a document using only information in 
said document, comprising the steps of: 

building a model of said document including an initial semantic structure; 
detecting changes in said semantic structure spanning different scales; and 
ordering said changes into entries in said table of contents based on scale span. 

2. The method of claim 1 wherein said table of contents is a hierarchical sequential 
description of topic changes in said document 

3. The method of claim 1 wherein said document includes at least one of the following: 
a text file, an audio file, a video file, a multimedia presentation. 

4. The method of claim 3 wherein said audio file includes music. 

5. The method of claim 3 wherein said audio file includes speech. 

6. The method of claim 3 wherein said text file is an audio transcript 

7. The method of claim 3 wherein frames in said video file are modeled by a number 
representing color intensity data. 
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8. The method of claim 3 wherein said model combines audio data and video data into a 
single unified document representation. 

9. The method of claim 8 wherein said video data is scaled to have similar influence in 
said model as said audio data. 

10. The method of claim 1 wherein said building step comprises the further steps of: 

defining a vector of terms occurring in said document; and 
mapping said document into a vector space by projecting scaled term occurrence 
histogram data onto said vector of terms. 

1 L The method of claim 10 wherein said terms include at least one of: words, phrases, 
sentences, paragraphs, shots in video data. 

12. The method of claim 1 1 wherein said terms are locally and globally weighted. 

13. The method of claim 12 wherein said local weighting includes the log of term 
frequency plus one, and said global weighting includes term firequency entropy 
weighting. 
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1 14. The method of claim 10 wherein said mapping step includes the further step of 

2 summarizing said terms using singular- value decomposition. 

1 15. The method of claim 1 wherein said detecting step comprises the further steps of: 

2 applying successively smaller scale filter windows to said model according to said 

3 initial semantic structure to construct a map of said changes versus scale; 

4 identifying local peaks in said contour map, said peaks being points of maximum 

5 vector derivative magnitude; 

6 tracing said local peaks back to a semantic structure change origin point; and 

7 measuring a span of scales over which each said change exists. 

1 16. The method of claim 15 wherein said filter windows are Gaussian. 

1 17. A system for generating a table of contents for a document using only information in 

2 said document, comprising: 

3 means for building a model of said document including an initial semantic 

4 structure; 

5 means for detecting changes in said semantic stracture spanning different scales; 

6 and 

7 means for ordering said changes into entries in said table of contents based on 

8 scale span. 
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18. A computer program product comprising a machine-readable medium having 
computer-executable program instructions tfiereon for generating a table of contents for a 
document using only information in said document, including: 

a first code means for building a model of said document including an initial 
semantic structure; 

a second code means for detecting changes in said semantic structure spanning 
different scales; and 

a third code means for ordering said changes into entries in said table of contents 
based on scale span. 
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